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ABSTRACT 

The gene encoding the spcrm-specifie protein % from the sea encumber Hotoihuna tubules a has been cloned and 
characterized, Sea cucumber sperm chromatin displays a somatic-like histone complement and is beaded with a constant 
227 bp DNA linker length, second to the longest repeat of sea urchin chromatin. Protein a small basic protein 
reminiscent of the C-tcrminal tail of histone Hi, appears at the onset of spermiogenesis and accumulates in npe sperm. 
The $ 0 gene displays a coding frame interrupted by three large intervening sequences which combine to make it the longest 
for a sperm-spectfk protein yet reported tea. 17.7 kb) The identified gene is present as a single copy and, in turn, encodes 
a polyadenylaled transcript. The protein specified by this gene has 77 residues, reproducing unaltered the partial amino 
acid sequence of the protein previously determined. The structural arrangement and content of the gene are totally 
unrelated to cell-cycle regulated histone gene structure. Instead, it combines several features common to replication- 
independent genes coding for histone variants and even to protamine genes. Inference is made about the potential 
implications of this divergence in gene arrangement as regards chromatin transitions and modulation of gene activity 
occurring during spermiogenesis. 


RESUME 

Le gene eodant pour Ja proteine basique nucleaire specifique des spermatozoides chez 
Eholothurie 

Le gene cod ant pour la proteine basique nucleai re p tl specifique des spermatozoides chez rholothuric Holothuria tuhulosa 
a 6le clond et caractcrisc. La chromatine du spermatozoVde d'holothune presente un complement d*histone dc type 
somatique et forme un chapelet a vet un intervallc constant dc 227 paires de bases d'ADN, qui est la seeonde plus longue 
repetition de chromatine d’echinoderrne. La proteine $ [Jt une petite proteine basique qui rappelle rextremite C-terminale de 
] r hi stone HI, apparaft au debut de la spermiogenfese et s’aceumule dans les spermatozoides murs, Le gene comprend une 
region codante interrompue p.ar trois grandes sequences intercalaires qui se combinent pour en faire le gene le plus long 
connu actuellcmcnt pour une proteine specifique des spermatozoides (environ 17.7 kb). Le gene identifie est present en 
copie unique et code pour un transmit polyadenyle. La proteine $ l} specific par ce gene a 77 residus parmi Icsquels on 
re trou ve sans erreurs la sequence part idle decides amines precedemment determinee pour la proteine. L'organisation 
strueturale ct le content] du gfene $ 0 sont sans relations avec la structure des genes des histones regules par Ic cycle 
celluiaire. Au contraire, ce gene combine plusieurs daracteristiques communes aux genes independents de la replication 
codant pour des variants d 1 histones et meme aux genes de protamines. La consequence poterUielle de cette divergence dans 
L organisation des g&nes en ce qui conceme Jes transitions de la chromatine el la modulation de l 1 activity du gfene pendant 
la spenmiogen&se est discutee. 
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Eukaryotic chromatin is a macromolecular nucleoprotein assembly essentially composed of 
DNA and basic proteins. Histones, which are relatively conserved through evolution, are the 
genuine protein components in chromatin of non-proliferating cells and are involved in the 
organization of the chromatin fibre into various structural hierarchies (for full review see [42]). In 
stark contrast, DNA in male germ cell lineages appears to be bound by widely diverse basic 
proteins as regards chemical composition and number, giving rise to a seemingly striking variety 
of protein molecules along the different zoological groups [32], spanning from those species 
which retain histones that are close to the somatic types, to those in which they are fully displaced 
by more basic proteins like nucleoprotamines [7], The chemical data reported comprise a variety 
of vertebrate species, from fishes [9, 17] to mammals [25]. It is apparent that an enhanced 
basicity of ihe sperm proteins favours a tighter packaging of DNA. This requirement in 
spermatozoa seems obvious as a means of protecting the genomic complement. Likewise, it may 
also be an evolutionary adaptation of sperm to endure long-term storage and transport in the 
absence of DNA repair mechanisms. However, the functional significance of basic protein 
diversity in sperm and its effect at the molecular level are still ill-defined. DNA packing may not 
be the exclusive role. The actual existence of germ-line variants and sperm-specific protein types 
argues for more discriminating assignments such as fine-structural transitions of chromatin related 
to modulation of gene activity and its final quiescence during spermiogenesis [14], 

The variability of sperm nuclear proteins is of unknown origin. A hypothetical evolution of 
some of these proteins from histone HI has been put forward on the basis of the protein 
composition of the sperm of some bivalve molluscs [4]. Histone HI and its many subtypes 
constitute the most heterogeneous of the histone classes [41], Its largest evolutionary sequence- 
variation appears confined to both the N-terminal and C-terminal extensions of the molecules 
whereas the hydrophobic central globular core remains fairly well conserved [13]. This 
asymmetric organization has led to the suggestion that the carboxyl-terminus is involved in the 
higher order compaction of chromatin [1]. Models to test that assumption are provided by marine 
invertebrates, particularly echinoderms and molluscs, These organisms deserve particular mention 
since their somatic histones apparently coexist with both sperm-specific variants and protamine¬ 
like molecules, different from fish or mammalian protamines [3, 32, 43]. These protein molecules 
mostly fit into classes 3 and 4 of BLOCH’s cytochemieai categorization of nucleoproteins in 
mature sperm cells [7]. These two types have been ulteriorly combined into a rather 
heterogeneous group of intermediary sperm-specific types, plausibly representing transitions from 
histones to protamine-like molecules [32]. 

Although the prevalent cellular histones are encoded by a highly reiterated multigene family 
whose expression is tightly coupled to DNA replication, histone-variant genes tend to be present 
in single or few dispersed copies not subjected to cell cycle regulation [11, 23]. Already 
classification schemes based on regulatory correlations have been devised [43]. Nonetheless, very 
little is known about the organization of tissue-specific, variant-histone genes, along with the 
evolutionary origin oi nucleoprotamine and protamine-like genes [30]. 

It is important to address these questions and obtain new evidence concerning histone to 
profamine transitions and the genes that encode them, aiming to understand at the molecular level 
their differential function and its influence on the structural organization of sperm chromatin. Our 
work has been involved in the analysis of chromatin from the germinal tissue of the echi noderm 
Holothuria tubulosa. During sperm maturation there is no bulk replacement of the histone 
complement, transitions being restricted to the addition of a sperm-specific, arginine-rich HI 
variant [31] and the presence in ripe sperm of a small basic protein termed <|> 0 [40]. The latter has 
an amino acid composition reminiscent of the carboxy-terminal region of sea urchin Hl-S, 
provided that Arg is considered equivalent to Lys [2], Incorporation of protein 4> 0 into chromatin 
occurs in the terminal stages of spermiogenesis [10], representing about 4% of the histone moiety 
of the mature spermatozoa. Nucleosome organization remains invariable throughout sea cucumber 
spermatogenesis with a constant DNA linker length of 227 bp [15] consistent with sea urchin 
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nucleosomal repeats, which exhibit the longest lengths ever measured. The isolation and sequence 
determination of a cDNA for H ’ tuhuiosa protein <(> G have been previously reported [33]. In the 
present paper we describe the molecular cloning and characterization of the gene encoding this 
protein specific to the sea cucumber sperm chromatin. This is the first gene coding for a hisione- 
to-protamine transition protein to be identified. 

material and methods 

Living organisms, Male specimens of the sea cucumber Holothuria tuhuiosa were collected periodically oft the 
catalonian shore during the breeding season, moved live to the laboratory in cold seawater and held in 8 a G seawater until 
used. Excision of gonads and sperm collection were performed as detailed elsewhere [36], 

Isolation and purification of genomic DNA. High molecular weight genomic DNA was extracted from fresh sperm 
suspensions 'esse mi ally as described [15]. Briefly, suspensions were treated with proteinase K (50 qg/ml) overnight at 
37^C, After incubation, samples were deproteinized by successive phenol and chloroform extractions and the aqueous 
phases precipitated with ethanol. The DNA was further purified by Cesium chloride banding and subsequent dialysis. 

Construction and screening of a sea cucumber genomic library. For construction pf the H , tuhuiosa genomal 
library', purified sperm DNA was subjected to a partial digestion with htbal to generate fragments with itam HI-compatible 
overhangs and subsequently size-fractionated by sucrose gradient centrifugation DNA fragments in the size range 12-20 
kb were pooled and ligated to dephosphorylated lambda-based Charon 35 UCh35j replacement vector [26] linearized with 
ZtamHL Ligation reactions were carried out at a vector to insert molar ratio of 2:L Recombinant phages were encapsidaicd 
and used to transform E, coli 555 recA~ cells yielding a litre of 3x10 s plaque-forming-units (pfu) per jug of ligated DNA. 
The genomal library was screened by in situ plaque hybridization [5], Plaques at a densiLy of Hr pfu were replicated onto 
nitrocellulose membranes and screened with a 441 bp long N, tuhuiosa cDNA done [33] labelled by random-priming 
with the Klenow enzyme [21 ]. 

Positive plaques were purified by plating at decreasing densities and the isolated phages were grown by cascade 
infection and banded onto ethidium bromide-containing CsCl gradients. The resulting DNA was purified by phenol and 
chloroform extractions and used for further analysis. All recombinant DNA manipulations were carried out by standard 
procedures [37] and conducted in accordance with established guidelines Tor recombinant DNA research. 

Restriction analysis and Southern transfers. DNA from positive recombinant clones was digested with selected 
endonucleases and pairwise combinations thereof. Where required, restriction fragments were electrophoresed on agarose 
gels, transferred to nylon membranes by alkaline blotting [34] after partial depurination and screened with six different 
regions of the 4 0 -cDNA cloned insert as probes, 

<P 0 . gene number. To assess the copy number of the gene in the haploid sperm genome of H. tuba torn, genomic 
DNA was digested, independently or in combination, with various endonucleases no! cleaving inside the t^-cDNA 
sequence. Restriction fragments were subsequently electrophoresed on 0.5% agarose gels, blotted onto a. nylon membrane, 
and hybridized to labelled probes prepared from the $ d -cDNA done. The ^ gene number was derived by comparison of the 
number and intensity of the autoradiographic signals measured by densitometric analysis, with those from graded amounts 
of the cloned 4 ^-cDNA equivalent to integer-copies per haploid genome of the cDNA sequence. Standards of cDNA were 
clectrophoresed in parallel, supplemented with a mass excess of sheared calf thymus DNA to compensate for the amount of 
resineled sea cucumber sperm DNA loaded on each gel slot. 

Plasmid subdoning and nucleotide sequence analysis. Genomic DNA restriction fragments of appropriate size 
identified by hybridization analysis as containing <> 0 -cDNA sequence tracts were excised off the gel T purified further on 
low-melt agarose, and ligated into the phagemid vector Bluescript + SK Chimeric recombinants were used to transform 
competent E. colt XL 1-blue recA~ cells and were selected as AmpRTcR:Lac‘ phenotypes. DNA inserts from the 
recombinant plasmids were sequenced by the dideoxy chain-termination procedure [38] using the Sequenase sysLem with 
forward and reverse primers for both orientations, Computer analysis was performed using the MieroGenie sequence 
analysis software {Beckman, USA), 


RESULTS AND DISCUSSION 

Spermatogenesis in the sea cucumber H . tuhuiosa is a rather simple process. Chromatin 
from ripe spermatozoa retains the five somatic-type histones in normal relative amounts 
accompanied by the highly basic protein (average M r - 8640), structurally related to histone 
HI [2], We have previously reported the characterization of a clone carrying a full length <p 0 
transcript, isolated from a cDNA expression library made from the poly (A + ) fraction of total 
RNA extracted from immature gonadal tissue and screened with polyclonal anti-<|> 0 antibodies 
[33], The 441 bp cloned cDNA encompassed a continuous open reading frame for a basic 
polypeptide of 77 residues whose sequence conformed to the partial amino acid sequence of d> 0 
previously established. Likewise, poly (A + ) selected RNA yielded a product electrophoretically 
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comigrating with protein <j) a upon in vitro translation in wheal genn, cell-free extracts, whereas 
Northern blot analysis detected only a 0.6 kb (j> 0 -mRNA transcript homologous to the cDNA 
probe. 

Isolation of the sea cucumber gene 

The genomal library of H. tubulosa sperm DNA cloned into the ACh35 vector was screened 
by hybridization with the <t> 0 -cDNA cloned insert. The screening of about 250 000 plaques 
yielded four positive clones with inserts of -16.5, 14.4. 13 and td.5 kb, named AHt7, AHt2, 
AHtl and AHt8, respectively, The four recombinants were next subjected to endonuclease 
restriction and Southern blot analysis using six cDNA-derived probes encompassing specific 
regions of the <|) 0 -cDNA clone: (i) the two asymmetric segments resulting from the cleavage along 
the EcoRl site internal to the 0 o -cDNA insert, namely, the 248 bp leader (S probe) and the 193 bp 
trailer (3' probe) fragments, encoding the first 69 and last 8 amino acids of protein <j> 0 , 
respectively; (ii) the 81 bp long EcoRl-Pstl fragment comprising the 5 r -flanking region and the 42 
bp sequence coding for the 14 amino acids inclusive of the initial methionine residue heading the 
NTenninus (P probe); (iii) the Ddel-EcoRl fragment of 8! bp representing amino acids 43 to 69 
(D probe); (iv) the two segments of 113 bp and 80 bp in length (XI and X2 probes) resulting 
from the nibbling of the d’-probe at the single Xbal site. Probe XI contained the sequence for the 
last 8 amino acids ot the C-terminus plus 88 bp of the adjacent downstream extension whereas 
probe X2 consisted ol the final 80 bp of the 3'-noncoding region of the 4> 0 -cDNA clone. Those 
restriction fragments shown to carry 4> e> sequences were subsequently subcloned for further DNA 
sequence analysis. 
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Feg. 1, Isolation and characterization of the Holothuria tubulosa gene. The restriction endonuclease map and 
organization ol the $ 0 gene are shown. The four positive isolates picked out from the sperm DNA library of 10-20 
kb Mbol partials cloned into the tiawH] replacement vector ACh35, were digested with several endonucleases and 
combinations thereof. Restriction fragments carrying ^ 0 -cDNA sequences were identified by hybridization. 
Positively reacting fragments were lunher purified, subcloned into the phagemtd Blucseript + SK and sequenced by 
the dideoxy chain-termination procedure of Sanger. Filled boxes indicate the relative positions of exons encoding 
tfn sequences consecutively numbered 1 to 4, and open boxes the 5' and 3' flanking regions homologous to the 
noncoding extensions of the cloned <j> p -cDNA- The hatched rectangular boxes highlight those regions of the sene 
dial were completely sequenced either on both DNA strands or repeatedly in one direction. B , D. £\ H , P, 5/, and 
A denote Bgt f. AM, EccRl Hindlll Pst I, Sail $stl, and Xha\ restriction sites, respectively. The thin lines depict 
the positions of the lour strongly hybridizing genomic clones uHl 1 - 8) used to map the * 0 gene. 
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The results of the restriction mapping are shown in Fig. 1 and can be summarized as 
follows. The entire i|> 0 gene appeared split into four distinct exon sequences scattered along the 
lengths of the clones. The latter displayed partial overlaps differing in extension. AHt7 harboured 
the 5’-proximal exons l, 2 and 3. The central exons 2 and 3 were also present in AHt2 and AHt8. 
In addition, the former contained the first 79 nucleotides of the fourth exon whereas the latter 
spanned its entire coding sequence extending 831 nucleotides beyond the stop codon. The 
shortest clone AHtl carried only exon 1 centrally positioned within the genomic DNA insert. 

On the basis of these results it was feasible to correlate the four recombinants and to 
conclude that the coding sequence of the sea cucumber 4> 0 gene is interrupted by three long 
intervening sequences which amount to 16.2 kb in total length (see Fig. 1). These unusually large 
introns combine to make the <j> 0 gene the longest for a sperm-specific protein {cci. 17.7 kb) so far 
reported. The overall organization of this sea cucumber gene appears to significantly diverge from 
that of the intron-less histone genes [27J and also from the arrangement of mammalian, singie- 
intron protamine genes [30]. 

Sequence analysis of the <j> 0 gene 

The nucleotide sequence of the sea cucumber 0 O gene is shown in Fig. 2. The coding 
sequence is discontinuous and encompasses an open reading frame For a basic protein of 77 
residues, interrupted by three introns involving canonical splice junctions [24], specifically those 
assigned to invertebrates [22]. The first and second introns of 6.8 and 4 kb in respective length, 
are inserted within codons 9 and 15, respectively. The third intron is 5.4 kb long and is 
positioned contiguous to codon 41. The complete coding sequence of the <j> 0 gene is identical to 
that of the cDNA clone previously reported [33] but for six nucleotide substitutions (97.9% 
homology). Five changes are conservative since they correspond to third-base degenerates of the 
most common synonym triplets, involving C for T and A for G conventional exchanges with no 
ensuing alteration of the assigned amino acid. The only relevant nucleotide substitution occurs at 
codon 61 and involves a G for A replacement in the first base causing a change of amino acid 
assignment. The overall level of sequence conservation observed argues for a stable organization 
of this gene. 

The deduced primary structure of the encoded protein corresponds exactly with the <f> 0 
sequence specified by the cDNA clone, exclusive of the noted single alteration at codon 61 
(98.7% homology). The cDNA sequence contains the triplet GCC for alanine in this position 
while the ACC counterpart in the gene codes for threonine. Most likely this difference arises from 
the DNA polymorphism detected in echinoderms [12] which is reflected in the well-documented 
intraspecific microheterogeneity found in a substantial variety of sperm variant proteins from 
marine invertebrates [28] as well as in the protamines of trout [18]. 

The close sequence homologies between the q 0 -cDNA and the cloned gene are endorsed by 
the identity of the deduced protein sequences which, in addition, reproduce wholly unaltered the 
partial amino acid sequence of protein 6 0 previously established. The coincident similarities 
observed sustain the conclusion that, in actuality, the cloned gene encodes the sperm-specific 
protein <]> 0 in H. tubulosa. 

Comparison of the nucleotide sequences in the noncoding regions of the <b 0 gene with those 
of replication-dependent histone genes reveals that the former lacks the conserved motifs defining 
the S-phase regulated histone gene structure such as the downstream hairpin loop sequence at its 
3‘ proximal purine-rich tract required for 3' processing of the histone transcripts [6]. Instead, the 
leader and trailer regions surrounding the <j> 0 gene combine several structural elements found in 
both replication-independent histone variant as well as protamine genes (see Fig. 2). 

The region upstream of the initiation codon contains an atypical TATA motif identical to the 
TTCAAA box identified in the cell-cycle independent H2Ap histone gene that codes for an 
extreme H2A variant in chicken, whose transcript is polyadenylated [16]. Significantly, both 
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elements are found similarly positioned 144 nucleotides upstream from the initiator triplet. There 
is another potential TATA motif with perfect homology to the non-canonical TTAAAT element 
present in both the chicken and duck H5 genes [19, 35]. This sequence starts at position -205, 65 
and 28 nucleotides further upstream than the mentioned homologues, respectively. Another 
general feature required for promotion of transcription by RNA polymerase II is a CAAT 
sequence often located between 70 to 90 bp upstream of TATA sites [8]. In this regard, the leader 
region of the gene displays a potential CAAT motif (-221 to -218) located 73 nucleotides 
upstream relative to the H2Ap-like TTCAAA box. Another motif shared with leader regions in 
protamine genes is the TGACGTCA sequence found far upstream in the <t> 0 gene (-489 to -482). 
This m-acting element, usually referred to as a cAMP regulatory element (CRE), is strictly 
conserved in all protamine genes and it is considered essential for the biological activity of cAMP- 
regulated enhancers [30], Since spermatogenesis in echinoderms is known to be under hormone 
control probably involving cAMP [39]. such a regulatory signal might well represent a link 
between hormonal signals and the expression of sperm-specific genes such that of protein p Q , 

The downstream extension of the <|> 0 coding sequence is devoid of the highly conserved 
structural features of the re plication-dependent histone genes required for the 3' end formation of 
histone transcripts. Instead, similarities with the equivalent regions in protamine genes are 
encountered. First, three potential polyadenylation signals are present, starting at positions 306, 
392 and 396 respectively, 3' to the TAA stop codon. The last two elements consist of the 
hep tamer AAATAAA which appears repeated with a trinucleotide overlap. This heptameric 
sequence motif bears a perfect homology with the conserved polyadenylation signal found in the 
protamine genes from salmon and trout [29]. Nonetheless, the significance of the close 
similarities encountered between the organization of the <> 0 gene and that of genes coding for 
extreme histone variants or even for protamines remains to be unambiguously defined. 

Genomic content of the 0 O gene 

The copy number of the <j> 0 gene was determined by Southern blot hybridization analysis of 
sperm DNA restriction digests with the 81 bp long EcoRI-Pa/I fragment (probe P) of the <(> 0 - 
cDNA clone comprising the 5'-flanking extension and the initial 42 bp of the coding sequence, 
labelled by random priming. A set of endonucleases lacking cleavage sites within the cDNA 
sequence was selected for the single and double enzyme digestions. DNA restriction fragments 
were electrophoretically resolved in conjunction with varying amounts of the cloned tfe-cDNA 
insert, diluted with a mass excess of sheared heterologous DNA to make up for those of restricted 
sperm DNA loaded on the gel lanes. The amounts of the cDNA standards, equivalent to one and 
four copies of the respective sequence per haploid genome, were inferred from the DNA content 
ot the haploid genome (C-value = 3 x 10 9 bp) of H. tubulosa sperm previously determined [36], 

Hybridization patterns from both single and combined enzyme restrictions yielded in every 
case only one size class of DNA fragment positively hybridizing with the cDNA probe (Fig. 3). 


Flo. 2. DNA sequence of the sea cucumber gene and Hanking regions. The 5 -+3" nucleotide sequence of the non- 
t ran scribed strand (i.e. mRNA-like) is given along with the amino acid sequence derived from the <|i 0 coding region, 
shown above the nucleotide sequence. The abbreviation ini and the asterisk mark the respective positions of the 
initiator and the stop codons. The noncoding leader and trailer extensions are numbered with negative and positive 
numerals begminng at nucleotides 5' adjacent to the initiation codon and 3‘ proximal to the Stop triplet, 
respectively. Coding triplets are denoted with numbers in italics. The slash symbols mark the normal donor splice 
site junctions. Positions ot the putative CRE. CAAT and TATA elements as well as the poly(A) addition signal 
discussed in ihe text, are doubly underscored. Most of the extensive intron sequences have been removed for 
clarity. 
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Exon 1 

~^DG -650 

GACTGCTCACGAGTGATAGCGCCCAAAeAAGCGTCCAAGAGGCCGAACAGACGCTAAGGCTCGCTACGGCGCGTTAGCAT 

-600 

GT AC AAAA T T GCCT C GG C GTTTTC AC T AG T AT ACGC GTTTC T AC GGC GATGGAATC G ACTTG AGT AAG ("‘TG AC C T AC ATC 

- 550 . , 500 

ACGTGACCTTCCATGA'rGGCTTCACAACAACACTGTGGGATGCGCCTTCCTGTTATATAGTTCAGTGGAGCCGATGCTTA 

“450 

T G AC G TC AT AATT TGGT GT AG AAG T C T AGT C C AC TGC AT ATTTCC AAACGTG AAG AAACGGTTTG A G AC C G AATC AAATG 

- 4.00 -3 50 

catttttcttaccagcttaccgcgctctggaatacgatgagtatgtatgcaatcctcagttcaaagcaaagtggacgcac 

-300 - 25 Q 

TA CGTTTGGACAGTG T AC CT AGC C T AC ATT ATTC ATG TT CTTGTT CCCT TT ATG CA CGTAAGGAC GCAAC CCG AT GC GAC 

-2 0 0 

GTT T C TG AAAC GGG C AC TT AG ACG C GC Gg^^TTGT AA T CACGTT TAAAT CAT AG T AT T TGG T G CGC GT AGC TAT AG GTG 

-150 ^100 

C GTT T ATGCGC C TC A C T T T GT AAAAXX££y^AAAA'T AA C AAAT AT TG T TT T C AG T T T T T AAAG AAC CG^tAC T T GAT C GC 

-50 

ttacagccgagcaactaagacgttggtcctacgcactgcccagttttgattcccccttgtgtcggaaattccaactagaa 

-1 5 

TCAATAATC ATG GTA GCC AGA CGA CAA ACA AAG AAA G/gTAAATAAAGGGAACAATATATGCAAGGCGTT 
ini Val Aid A rg Arg Gin Thr Lys Lys A 


Exon 2 


10 15 

TTTTATTTCTTCTTTCTCACTTCACAG CT AGG AAG CCT GCA GCC AG/GTGAGTGAATACAATTTAAATTTTAT 

id Arg Lys Pro Ala Ala Ar 


Exon 3 


20 25 

TAACTCAAGACTTTAAATGCTTTGTCCTTCCTCCCCAG G AGA CGC AGC GCA GCC AAA CGC GCA GCC CCA 

g Arg Arg Ser Ala Ala Lys Arg Ala Ala Pro 


30 35 40 

GCC GCG AAG AAG GCT GCG AGT CGC CGT CGC CCA AAG AGT GCT AAG AAG/GTAGGTAATAAGATGT 

Ala Aid Lys Lys Ala Ala Ser Arg Arg Arg Pro Lys Ser Ala Lys Lys 


Exon 4 


45 50 

AAT T AAAAAAAGT C C TG AC AAT AT ATT T TT C TCTTTTC AG GCT AAG CCC GCA GCA AGG AGG CGC AGC AGC 

Ala Lys Pro Ala Ala Arg Arg Arg Ser Ser 


55 60 65 70 

GTC AAA CCT AAA GCA GCA AAA GCA GCC ACC CAA GTC CGT GGC AGG AGC CGA CGA ATT CGC 

Vdi Lys Pro Lys Ala Ala Lys Ala Aid Thr Gin Val Arg Arg Arg Ser Arg Arg He Arg 


75 +1 +50 

CGT GCG TCC GTG TCA AAG TAA TTC AATG G AAG AC TG ATC ATT AAAT C GTA AC C C CTTCG AAAG ATT AA A CT T A 

Arg Ala Ser Val Ser Lys * 

+100 

tcaaatttcattttgtagaactgtccaaattttctagaatattgcagaactgaacatttaaaacacatccaaattcgtaa, 

+150 +200 

GC G AACAAG C AAGCAAC GAT GAC CT AC AAT T TACAGTC GTTTCTT AT TAT TTC AAGTTTG CCT TT ATTC AG T TTC AGTT T 

+ 2 50 

cagtttatttactctttaatacctcctcggaggtgtcagagtcaaaacatacaattagatacacagaaatatacaaaaag 

^300 +350 

C AGC AAGATCAAC AAT AAAC AAAAAC AAAAAAC AAAAAT CAT GC AAGC AAATC AG TAC AAT C AAAAAAC T AACT T C AACC 

+400 +450 


+ 500 

T ACG C AC AAC AC AAC GAG C AACC T AC G T AT GC AC TAATGCCTAGCTAT AC AC ACT AC AT CATC AAAT C AAC T ACGG C AC T 

+550 - 6 QQ 

C ATT AAC GAT GAT AT AC ACT AC AG AAGTGGC C AG G T TTC TTTGCAC CT CCT TATTTCT AG AC GTT AAAAGC T GGGCC ATT 

+ 65 0 

TT T AAAG T AT T T GGAC C C AC CC TATAATAATGCTT CAAATA G TATTTC CGA T CTGAGTTGAAAG C C TTACAT TGAAAAAG 

+700 +750 

AT AATG AAATT AC ATC T C C T ATC AGC C C AC T AG T AC AAAG T T C AC AAAGTCTA T CAT T AAAGT C AAT ATT AAG AAAT C GT 
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In turn, each singly-reacting fragment appeared to hybridize with the probe lo a similar extent as 
revealed by the intensity of the autoradiographic signals estimated from densitometer tracings. 
Comparison of the level of hybridization of the genomic fragments with the quantified signal 
intensities of the cDNA standards yielded an average value of 0.92 copies of the <f> 0 gene per 
haploid genome. These results indicate that the H. lubulosa sperm <j 0 protein is probably specified 
by a single copy gene. 
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FtG. 3. — H. tubulosa cs 0 gene copy 
number. Autoradiogram of 
Soul hern blots of sea cucumber 
sperm DNA restrict ion digests 
hybridized with P- pro be of the 
cDNA (see [ext for details); Sperm 
DNA samples (20 fig) were 
digested with: lanes I lo 6 Bam HI; 
ffmdlll; Kpnl: Bam HI + //mdlll; 
Bam HI + Kpnl; HitnUll + Kpt il, 
respectively. Graded amounts of 
the tj> 0 -cDNA cloned insert 
equivalent to 4 (lane 7) and 1 
copies (lane S) per haploid 
genome, supplemented with a 
mass excess of sheared calf thymus 
DNA were co-eleetrophoresed as 
hybridization standards. Sizes of 
restriction fragments from the * 0 - 
cDNA clone run in parallel as 
migration markers* are given in bp 
(lane m). 
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The structural arrangement and content of the 0 f1 gene closely coincide with the common 
features ol a sizable number of post-meiotically expressed genes, typified by those encoding most 
hislone-to-protamine transition proteins as well as protamine genes [20, 23]. Besides being 
expressed in a replication-uncoupled manner, these genes usually generate polyadenylated 
transcripts and most of them, although not all, are present as single copies containing coding 
regions often interrupted by intervening sequences. The overall organization of these genes 
becomes clearly divergent from that of the somatic histone genes. The functional implications, if 
any, of this divergence in gene arrangement remains unexplained, although potential correlations 
with chromatin transitions related to modulation and final arrest of gene activity during the 
spenruogenic process should be taken into account. Further studies are underway to characterize 
new genes encoding known sperm-specific protein variants. 
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